CIRB 030 ( Chinese Information Retrieval Bench , version 3 . 0 )
نویسندگان
چکیده
An information retrieval (IR) test collection is used to evaluate the performance of IR systems. It is a helpful and powerful tool for investigation of the developing systems and the developed systems. CIRB030 (Chinese Information Retrieval Benchmark, version 3.0) test collection is such kind of test collection, which is designed to be used for evaluation of Chinese document retrieval. There are 4 folders and 10 files in CIRB030 CD-ROM. Please take a look at the Figure 1.
منابع مشابه
AINLP at NTCIR-6: Evaluations for Multilingual and Cross-Lingual Information Retrieval
In this paper, a multilingual cross-lingual information retrieval (CLIR) system is presented and evaluated in NTCIR-6 project. We use the language-independent indexing technology to process the text collections of Chinese, Japanese, Korean, and English languages. Different machine translation systems are used to translate the queries for bilingual and multilingual CLIR. The experimental results...
متن کاملExploiting the LDC Chinese-English Bilingual Wordlist for Cross Language Information Retrieval
We investigated using the LDC English/Chinese bilingual wordlists for English-Chinese cross language retrieval. It is shown that the Chinese-to-English wordlist can be considered as both a phrase and word dictionary, and is preferable to the English-to-Chinese version in terms of phrase translation and word translation selection. Additional techniques such as frequency-based term selection, tra...
متن کاملAINLP at NTCIR-6
In this paper, a multilingual cross-lingual information retrieval (CLIR) system is presented and evaluated in NTCIR-6 project. We use the language-independent indexing technology to process the text collections of Chinese, Japanese, Korean, and English languages. Different machine translation systems are used to translate the queries for bilingual and multilingual CLIR. The experimental results...
متن کاملCINDOR TREC-9 English-Chinese Evaluation
MNIS-TextWise Labs participated in the TREC-9 Chinese Cross-Language Information Retrieval track. The focus of our research for this participation has been on rapidly adding Chinese capabilities to CINDOR using tools for automatically generating a Chinese Conceptual Interlingua from existing lexical resources. For the TREC-9 evaluation we also built a version of our system which loosely integra...
متن کاملA Hybrid Chinese Information Retrieval Model
A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no space or boundary between Chinese words. This feature makes Chinese information retrieval more difficult since a retrieved document which contains the query term as a sequence of Chinese characters may not be really relevant to the query since the query term (as a sequence Chinese characters) may n...
متن کامل